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Abstract. The application of Stein's method for distributional ap- 
proximation often involves so called Stein factors (also called magic 
factors) in the bound of the solutions to Stein equations. However, 
these factors sometimes contain additional terms such as a logarithmic 
term for Poisson point process approximation, leading to unsatisfactory 
estimates. Despite the fact that is has been shown for many of these 
Stein factors that the known bounds are sharp and thus that the ad- 
ditional terms cannot be avoided in general, no probabilistic examples 
have been presented in the literature which justify these Stein factors. 
In this article we close this gap by constructing such examples more or 
less explicitly. As a side effect, a new interpretation of the solutions to 
Stein equations is given. 



1. Introduction 

Stein's method for distributional approximation, introduced by Stein ( 19721 ). 
has been used to obtain bounds on the distance between probability measures 
for a variety of distributions in different metrics. There are two main steps 
involved in the implementation of the method. The first step is to set up 
the so-called Stein equation and to obtain bounds on its sol utions and t heir 
derivatives or differences, either analytically, as for ex ample Stein (Il972l 'l. or 
by means of the probabilistic method introduced by Barboui (jl988l ). Us- 
ing these bounds, one then tries in a second step to bound an expecta- 
tion involving the so-called Stein operator. There are vario us techniques to 
achiev e this, such as the local approach by Steinl (1972) and lChen and Shao 
2004 ) , the exchangeable pair coupling by ISteinI |l986l ) (but see also iRollin 
2006ll on how to rem ove t he exchangeability co n dition ), size biasing by 



Barbour et al. _j 1992b ) and iGoldstein and Rinott (Il996ll. zero bia sing by 
Goldstein and Reinert ( 19971 ). w-functions by Cacoullos et al. ( 19941 ) and re- 
lated work. 

In the first step, the so-called Stein factors often play a crucial role in 
the bounds of the solutions to the Stein equation. In this article the term 
Stein factor is used to refer to the asymptotic behaviour of these bounds as 
some of the involved parameters go to infinity or to zero, using the usual 
0-notation. Many of the known factors are not satisfactory, because they 
contain additional terms which often lead to non-optimal bounds in the 
applications, and usually much additional work in both steps has usually 
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to be carried out to circumvent this problem; see for example iBrown et al 



There are also situat ions where the s olution s can grow exponentially 



fast, a s has been shown by Barbour et alJ ( 1992al ) and Barbour and Utev 



( 19981 ) for some specific compound Poisson distributions which greatly limits 



the usability of Stein's method in these cases. 

To make matters worse, for many of the Stein fac t ors it h as been shown 
that th ey ca nnot be i mprov ed; see iBrown and lOal (I199RI 1. iBarb our et al 



(Il992ah and iBarbouil tOO^ . However, these articles do not address the 



question whether the problematic Stein factors express a fundamental flaw 
in Stein's method or whether there are examples in which these additional 
terms are truly needed if Stein's method is employed to express the distance 
between the involved probability distributions in the specific metric. 

The purpose of this note is to show that the latter statement is in fact true. 
We will present a general method to construct probability distributions; this 
construction not only explains the presence of problematic Stein factors, but 
also gives insight into Stein's method. 

The paper is organised as follows. In the next section, we recall the 
general approach of Stein's method in the context of Poisson approximation 
in total variation to demonstrate the basic construction of the examples. 
Then, in the remaining three sections, we apply the construction to the 
multivariate Poisson distribution, Poisson point processes and compound 
Poisson distribution, as in these cases the Stein factors are important to 
obtain good bounds in the applications. 

2. An illustrative example 

As a first illustration on how to construct examples which make full use 
of Stein factors and also to recall the basic steps of Stein's m ethod, we start 



with th e Stein-Chen method for Poisson approximation (see iBarbour et al 
(ll992bl )V 



Let the total variation distance between two integer-valued random vari- 
ables W and Z be defined as 

dTY{^{W),^{Z)) := sup |E/i(VF)-E/i(Z)|, (2.1) 

where the set Hty consists of all indicator functions on Z. Assume that 
Z ~ Po(A). Stein's idea is to replace the difference between the expectations 
on the right hand side of (|2.ip by 

]S{ghiW + l)-Wgh{W)}, 

where gh is the solution to the equation 

^9hij + 1) - m(j) = Hj) - miz); j e Z+. (2.2) 

The left hand side of (|2.2p is an operator that characterises the Poisson 
distribution; that is, for Ag{j) := \g{j + 1) — jg{j), 

E^5(y) = for all bounded 5 y ~ Po(A). 



Assume for simplicity that W has the same support as Po(A). With (|2.2|) 
we can now write (12.111 as 



dTv(^(W),Po(A)) = sup \]EAgh{W)\. (2.3) 

heHTV 



ON STEIN FACTORS AND EXAMPLES WITH SHARP RATES 



3 



It turns out that (|2.3I) is often easie r to bound than (I2.1I1. 



Barbour and Eagleson ( 19831 ) and Barbour et al. (jl992bl ) showed that, for 



all functions h E Ti-TV, 



\9h\\ 5^ 1 A 




\Agh 



A 



(2.4) 



where || • || denotes the supremum norm and Ag{j) := g{j + 1) — g{j)- Here, 
the Stein factors are A~^/^ and A~^, respectively, if one is interested in the 
asymptotic A — > oo. With this we have finished the first main step in Stein's 
method. 

As an example for the second step and also as a motivation for the main 
part of this paper, assume that is a non-negative integer- valued random 
variable and assume that r is a function such that 

]S{{W - X)g{W)} = 'E{T{W)Ag{W)} (2.5) 

for all bounded functions q: see ICacoullos et"aD(ll994l ) and lPapathanasiou and UtevI 

i'or more details on this approach. To estimate the distance between 
^iW) and the Poisson distribution with mean A, we simply use (|2.3p in 
connection with (12.511 to obtain 

sup \Agh{W)\ 

heHTv 

sup \]S{XghiW + l)-Wgh{W)}\ 
sup \E{XAgh{W) - (W - X)gh{W)]\ 
sup \E{{X-T{W))Agh(W)}\ 

heHTV 



dTv(^(^),Po(A)) 



(2.6) 



1 - e-A 



A 



^T{w)-x\ 



where for the last step we used (|2.4|) . Thus, (|2.6p expresses the ^TV-distance 
between ^(W) and Po(A) in terms of the average fluctuation of r around 
A. It is easy to show that r = A if and only if ~ Po(A). 

Assume now that for a fixed positive integer k, r(w) = A -|- 5k{w), where 
6k{w) is the Kronecker delta, and assume that Wk is a random variable sat- 
isfying (|2.5|) for this r. In this case we can in fact replace the last inequality 
in (|2.6|) by an equality to obtain 



dTY{^{Wk),PoW) =-p[Wk = k] sup \Agh{k)\. (2.7) 

heHTv 

From Equation (1.22) of the proof of Lemma 1.1.1 of Barbour et al. ( 1992bl ) 
we see that, for k = [AJ , 

sup \Agh{k)\ X A-i 
heHTv 

as A — > oo. Thus, (|2.7p gives 



(2.8) 



dTv(^(VFfc),Po(A)) X F[Wk = k]X-^ (2.9) 

as A ^ oo. Note that, irrespective of the order of F[Wfc = /c], the asymptotic 
pj]) makes full use of the second Stein factor of (EH). To see that ^{Wk) 
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in fact exists, we rewrite (|2.5p as ]EiI3g{Wk) = 0, where 
Skgiw) = Ag{w) + 5k{w)Ag{w) 

= {X + 6k{w))g{w + l)- {w + 6k{w))g{w). 
Recall from Barbouil (jl988l ). that A can be interpreted as the generator 



of a Markov process; in our case, as an immigration-death process, with 
immigration rate A, per capita death rate 1 and Po(A) as its stationary 
distribution. Likewise, we can interpret as a perturbed immigration- 
death process with the same transition rates, except in point k, where the 
immigration rate is increased to A -|- 1 and the per capita death rate is 
increased to 1 + 1/ k. Thus, ^{Wk) can be seen as the stationary distribution 
of this perturbed process. 

If A; = [AJ , the perturbation of the transition rates at point k is of smaller 
order than the transition rates of the corresponding pure immigration-death 
process in k. Thus, heuristically, F[VFa: = k] is of the same order as the 
probability Po(A){/c} of the stationary distribution of unperturbed process, 
hence F[M4 = A:] x A~^/^, and (|2.9p is of order A~^/^. We omit the rigorous 
proof of this statement. 

Remark 2.1. Note that by rearranging (|2.7p we obtain 

d^y{j^{Wk),J^{Z)) 

sup \Agh{k)\ = ^-y— 2.11 

for positive k. We can assume without loss of generality that g/t(0) = gh{l) 
for all test functions h because the value of 5^(0) is not determined by (12.21) 
and can in fact be arbitrarily chosen. Thus A(7/i(0) = and, taking the 
supremum over all k G we obtain 

sup Aff,, =sup ^—T— . 2.12 

h&HTv fc^i ^Wk = k\ 

This provides us with a new interpretation of the bound ||Ag(ft||, namely as 
the the total variation distance (or whatever metric or set of test functions, 
respectively, is under consideration) between some very specific perturbed 
Poisson distributions and the Poisson distribution, relative to the probability 
mass at the loc ation o f thes e perturbations. 
Let us quote Chen ( 19981 ). page 98: 



Stein's method may be regarded as a method of constructing cer- 
tain kinds of identities which we call Stein identities, and making 
comparisons between them. In applying the method to probability 
approximation we construct two identities, one for the approximat- 
ing distribution and the other for the distribution to be approx- 
imated. The discrepancy between the two distributions is then 
measured by comparing the two Stein identities through the use 
of the solution of an equation, called Stein equation. To effect the 
comparison, bounds on the solution and its smoothness are used. 

Equations (12. lip and (I2.12p make this statement precise. They express how 
certain elementary deviations from the Stein identity of the approximating 
distribution will influence the distance of the resulting distributions in the 
specific metric, and they establish a simple link to the properties of the 
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solutions to (|2.2I) . We can thus see W from (|2.5p as a 'mixture' of such 
perturbations which is what is effectively expressed by Estimate (|2.6p . 



Thus, to understand why in some of the applications the Stein factors are 
not as satisfying as in the above Poisson example, we will in the following 
sections analyse the corresponding perturbed distributions in the cases of 
multivariate Poisson, Poisson point processes and compound Poisson distri- 
butions. 

Unfortunately, in the multivariate setting, the perturbations needed to 
obtain an equation of the form (|2.7p are not as straightforward. The attempt 
to simply add the perturbation as in (I2.10p . will in general result in an 
operator which is not interpretable as the generator of a Markov process. 
However, under suitable symmetry assumptions and a slight modification of 
(|2.10|) . we can still use the generator interpretation and arrive at an equation 
of the form ([TTp . 

3. Multivariate Poisson distribution 

Let d > be an integer, fj, = (//i, . . . , /x^) G M!^ and A > 0. Let Po(A/i) be 
the distribution on defined as Po(A//) = Po(A/ii)c?i- • ■ (^Fo(Xuri ). Stein's 
metho d for multivari ate Poisson approxim ation was introduced by iBarbour 
( 19881 ): but see also Arratia et al.l (|1989l ). Let e^*) denote ith unit vector. 
Using the Stein operator 

d d 

Ag{w) ■.= Y,^M9{w + e^''^) - g{w)} +Y,M9iw - e^'^) - g{w)} 

1=1 i=l 

for w S Z^^, it is proved in Lemma 3 of iBarbouil ( 19881 ) that the solution qa 
to the Stein equation AgA{w) = 5a{w) — Po(A)u){j4} for A C Z"^, satisfies 
the bound 

\^ . ri + 21og+(2A) ^^a2 ^ A 
^a,a,Ai,gA < mm| 2A z2f^l2''U (^.1) 

i,j=l 1=1 1=1 

for any a G M!^, where 

Aijg{w) := g{w + e^'^ + e^-''^) - g{w + e^'^) - g{w + e^^^) + g{w). 
Let now = [A/ijJ for i = 1, . . . , d and define 

^1 = {u; G : ^ u-i ^ mi, ^ u'2 ^ m2}. (3.2) 
Barbouil hm± proves that, if fii,^2 > and A ^ (e/327r)(/ii A fi2) ^, then 

1^129 Adw)\ > J^^^— (3.3) 

for any w with {wi,W2) = (mi, 7712). It is in fact not difficult to see from the 
proof of (|3.3|) that this bound also holds for the sets 

A2 = {w e 'E'l : wi > mi, ^ W2 ^ 1112}, 

As = {w £ Ij'I : ^ wi mi, W2 > 1712}, 

A4^ = {w £ Z'^ : wi > nil, W2 > ^2}- 
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Figure 1. Illustration of the perturbed process defined by the 
generator l|3.4p . Between any of two connected points on the lat- 
tice Z^, we assume the transition dynamics of a pure immigration- 
death process, that is, in each coordinate immigration rate A/Xj 
and per capita death rate 1. The arrows symbolise the additional 
perturbations with respect to the pure immigration-death process; 
each arrow indicates an increase by 1 of the corresponding tran- 
sition rate. The resulting differences of the point probabilities 
between the equilibrium distributions of the perturbed and un- 
perturbed processes are indicated by the symbols -I- and — . The 
corresponding signs for each of the quadrants can be dedu ced using 
the St ein equation, Equation p.6p . and Equation (2.8) of lBarbouii 
|2003). 



Example 3.1. Assume that is a random variable having the equilibrium 
distribution of the d-dimensional birth-death process with generator 

BK9iw) = Ag{w) + 6j^^^j2) {w) [g{w + e^^)) - g{w)] 
+ 5K_^^m{w)[g{w - e^^'^) - g{w)] 

d 

= Y,{\^^^ + h{i)5K+ei^){w))[g{w + E^'^)-g{w)\ (3.4) 

i=l 

d 

+ Y^iwi + 5i{i)5^^^w{w)) [g{w - e«) - g{w)\ 

i=l 

where K = (mi, m2, . . . , m^). Assume further that = /i2, thus rrii = m2. 
See Figure [T] for an illustration of this process. 

Lemma 3.1. Under the above assumptions, J^{W) is invariant under in- 
terchange of its first two components. 
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Proof. Let D = and partition D into the disjoint sets 

Dl = {w = iwi,W2, ... ,Wd) e D : wi < W2} 
Du = {w = {wi,W2, . . . ,Wd) e D : wi > W2} 
Dm = D\{DlUDu). 

Recall that the stationary distribution {tt{w); w G D} for the Markov process 
with generator given by (13.41) is the solution to the (infinite) set of linear 
equations 

i'{w)t:{'w) = '^^ q{v,w)T:{v) (3.5) 

for w G D, where v ^ w denotes summation over the neighbouring sites of w 
(two points being neighbours if the differ by 1 in exactly one coordinate), and 
where v and q are the total transition rates out of a state and the transition 
rates between neighbouring states, respectively, defined by (|3.4I) . It is now 
easy to check that the equations (|3.5p for w £ are the same as for w S Du 
but with the first two coordinates interchanged, because /ii = ^2 and thus 
K G Dm- Symmetry follows now from this and the observation that the set 
of equations (|3.5|1 for w S D^ that define the probabilities {■k{w);w G Dl} 
depend only on the transition rates within Z)^, the transition rates between 
Dl and -Dm, and the probabilities {Tr(w);w E Dm}, as there are no direct 
transitions between Dl and Du. □ 

Now, noting that for any bounded g we have lEBKg{W) = 0, 

EAg{W) = EAg{W) - ISBRgiW) 

= -]p[W = K + [g{K + e(2) + _ ^(^ + e^)! 

- ¥[W = K + e(^)] [g{K) - g{K + e^^))] 
= -F[M/ = i^ + e(i)]Ai25(i^), 



(3.6) 



where we used ¥\W = K + = V\W = K + e^^)] ^.j^g ^^^^ equality 
which follows from Lemma 13.11 as K + e^^^ , when the first two coordinates 
are interchanged, is equal to K + e^^^ . Thus 

dTv(^(VF),Po(A^)) = sup \^Agh{W)\ 

heHTv 

= F[M^ = K + e«] sup \Ai2gh{K)\ 
heHTv 

^ F[W = K + £(^)]logA 
^ 20A^/ii/X2 

On the other hand, from (13.111 for a = e^^^, a = e^^^ and a = e^^^ + e^^) 
respectively, it follows that 

, „ ^ (l + 21og+(2A))(Mi + M2) 
2\nifi2 
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This yields the upper estimate 
dTv(^(VF),Po(A/u)) = ¥[W 



+ sup \^i29h{K)\ 



^nW = K + ,(i)] (l + 21og^(2A))(/.i+/x.) 



and thus we finally have 

dTv(^(W^),Po(A/i)) 



¥[W = K + e(^)]logA 



A 



(3.7) 



Now, again heuristically, F[VF = K+e^^^] will be of the order Po(A//i){mi} x 
• • • X Po(A^rf){mrf} X A'='/2 ^ SQ tijat JMl) will be of order log A/A^+^Z^^ 

Recalling that the test function (13. 2p and also the corresponding test func- 
tions for the other three quadrants are responsible for the logarithmic term 
in (13.7(1 . we may assume a situation as illustrated in Figure [1] for d = 2. 
Different to the one-dimensional case, where the perturbation moves prob- 
ability mass from the point of the perturbation to the rest of the support 
in a uniform way, the perturbations of the form (|3.4|) affect the rest of the 
support in a non-uniform way. Further analysis is needed to find the exact 
distribution of the probability mass differences within each of the quadrants. 

Note that the perturbation (|3.4|) is 'expectation neutral', that is W has 
expectation A/i which can be seen by using ]EBg{W) = with the function 
gi{w) = Wi for each coordinate i using in addition Lemma EH] if i = 1. 



4. POISSON POINT PROCESSES 



Stein's meth od f or Poisson point process a pproximation was derived by 
Barbour (jl988l ) and Barbour and Brown ( 19921 ). They use the Stein operator 



Mi) = ^ W + - 9(0] Hda) + ^ [g{C -6^)- g{0] ^da), 

where is a point configuration on a compact metric space T and A de- 
notes the mean measure of the process. The most successful approxima- 
tion results have be e n obt ai ned in the so - called d^-m etric; see for exampl e 
Barbour and Brownl (Il992l l. iBrown et all tOQii ) and ISchnhmacheil HqoB). 
Assume that F is equipped with a metric do which is, for convenience, 
bounded by 1. Let be the set of functions / : F — > R, satisfying 

fix) - f{y) 



sup 



^ 1. 



do{x,y) 

Define the metric di on the set of finite measures on F as 



di{i,v) 



ife(F)/r?(F), 
e(F)-i sup j fdi-j fdv if m = r?(F). 



Let now Ti.2 be the set of all functions from the set of finite measures into IR 
satisfying 

Kv) - HO 



sup ■ 



di{i,7i) 



^ 1. 
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Figure 2. Illustration of the perturbed process defined by the 
generator l|4.4p using the same conventions as in Figure [TJ The 
corresponding signs can be obtained through the Stein equation, 
equation l|4.5p and the represe n tation of the solution of the Stein 
equation as in iBrown and Xial (1995), for the different test func- 
tions (l42l). 



We then define for two random measures ^ and ^ on F the d2-iQetnc as 
d2(^($),if(^)) := sup |E/i($) -E/i(^)|; 



for more details on the (i2-nietric see Barbour and Brownl (Il992l ). 

If /t g TCy and O h solv es the Stein equation AghiC) = ^(C) ~ Po(A)/i, 
Barbour and Brown ( 19921 ) prove the uniform bound 

\\Ao.pghm ^ 1 A 1 + 21og+f^l Y (4.1) 



where 



2|A| 



Brown and Xial (Il995l l that the log-term in (14.1(1 is 



Brown et al. (j2000l ) have shown that it is possible to 



It has been shown by 
unavoidable. However, 
obtain results without the log using a non-uniform bo und on Aa^pgh- 

Following the construction of Brown and Xia (jl995l ). assume that F = SU 
{a}U{6}, where 5 is a compact metric space a and b are two additional points 
with do{a,b) = do{b,x) = dQ{a,x) = 1 for all x £ S. Assume further that 
the measure A satisfies A({a}) = A({6}) = 1/|A| and thus A(5) = |A| — 2/|A|. 
For ma,mh £ {0, 1}, define now the test functions 



HO 



else. 



(4.2) 



It is shown by direct verification that h G 0.2- Brown and Xia (Il995l ) prove 
that, for rua = mi, = 1, the corresponding solution gh to the Stein equation 
satisfies the asymptotic 

(4.3) 



\^ab9h 



|A| 



as |A| ^ oo, so that (|4.ip is indeed sharp, but it is easy to see from their 
proof that (|4.3p will hold for all other possible values of rUa and m^, as well. 
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Example 4.1. Let T and A be as above with the simphfying assumption 
that S is finite (in order to apply Lemma ISTT]) . Let ^' be a random point 
measure with equilibrium distribution of a Markov process with generator 

BogiO = Agio + iO W + h)- g{i)] + 6s, (6 [giC-6b)- giO] 

= ^ + So.) - g{0] (A + 5s^ m){da) ^4 4^ 

+ ^ [giC - Sa) - g{0] (e + 5s,iOSb){da). 

See Figure [2] for an illustration of this process. 

Note that, as S is finite, we could rephrase the example in terms of the 
multivariate Poisson distribution as in Section [3l but now using another 
(weaker) metric and also a different assumption on A. Where as in Section 
[3] we assumed that the mean of each coordinate is of the same order |A|, 
we assume now that there are two special points a and b with vanishing 
expectation mass attached to them. However, the immigration rates at the 
two coordinates a and b are the same, which allows us to apply Lemma 13.11 

Now, for any bounded function g, 

E^5(*) = E^5(^') - E.Bogi'if) 

= -F[$ = Sb] [g{6a + 6b) - g{5a)] - = 5a] [g{Sa) - g(,0)] 
= -F[^ = 6a]Aabg{0), 

(4.5) 

where Lemma [XT] was used for the last equality. Thus, using (|4.3p . 

d2(^(^),Po(A)) = = 6a] sup \Aabghm ^ = '^"J 1^1 . (4.6) 

h£H2 1^1 

Figure[2]illustrates the situation for |r| = 3. If the process <I>t is somewhere 
on the bottom plane, that is ^{S) = 0, it will most of the times quickly 
jump upwards, parallel to the 5-axis, before jumping between the parallels, 
as the immigration rate into S is far larger than the jump rates between the 
parallels. Thus, because of the perturbations, probability mass is moved — 
as illustrated in Figure [2] — not only between the perturbed points but also 
between the parallels. Although indicator functions are not in Ti.2, the test 
functions in (14. 2|) decay slowly enough to detect this difference. 



Remark 4.2. Note again, as in Example 13.11 that the perturbation in the 
above example is neutral with respect to the measure A. It is also interesting 
to compare the total number of points to a Poisson distribution with mean 
|A| in the ^TV-distance. 

Note that (|4.5I) holds in particular for functions gh which depend only on 
the number of points of Thus, using (12. 3|) in combination with (|4.5I) yields 

dTy{^m),Fo{\X\))=F[^ = 6a] sup | A ^ (0) | X ^ = '^"^ , 

where A.'^g{w) = Ag{'w+1) — Ag{'w) (which corresponds to the first difference 
in (12. 4p ) and where we used the fa ct that A^g^/O) p_[Aj_"^, again obtained 
from the proof of Lemma 1.1.1 of iBarbour et al. I (Il992bl ). Thus we have 



effectively constructed an example, where the attempt to match not only the 
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number but also the location of the points introduces an additional factor 
log|A| if using the (i2-metric. 

In this light the p rese nce of the logarithm i c term in the estimates of 
Barbour et al. (1992b) and Barbour and Brown ( 19921 ) is rather natural and 
may very well be unavoidable if no further assumptions on t he measure A 



see 



Brown et al 



or on the random measure under consideration are made, 

"or the lat t er. Th is is in contrast to the conjecture in Remark 10.2.6 
of iBarbour et al.l (Il992bl l. 



5. Compound Poisson distribution 



Let A = (Ai,A2, 



be a sequence of non-negative numbers such that 



Yli^i'^^i < Then we define the compound Poisson distribution CP (A) 
to be the distribution of X]i>i^^«) where Yi ~ Po(Aj) and all the Yi are 
independent. Stein's method for direct compound Pois son approximation 
has been thoroughly i ntroduced by 
setting); but see also lArratia et al. 



Barbour et aP (Il992al ) (in a more general 
(jl99Cll ) who use the 'Poisson declumping 
heuristic' for compound Poisson approximation using Stein's method but in 
a different way. 

The Stein operator has the form 



wg[w) 



(5.1) 



and it is shown in iBarbour et al. 1 (Il992al ) that the solution to the Stein 
equation AgAiw) = 6a{w) — CP{\){A} satisfies the general bound 

fi aa; 



\9A\ 



1 ;^ 5 (5-2) 

where ||A|| = X^j^iAj. This general bound is sharp in the limit as ||A|| 
approaches zero. However if ||A|| — > oo this bound is in general useless. In 
the case where iXi ^ (i + l)Aj+i for all i there are better bounds available, 
making use of the fact t hat (15. 1|) can then be interpreted as the generator 
of a M arko v process: see IBarbour et al. ( 1992a ). but also Barbour and Xia 
(I1999I I and IBarbour et al.l (120071 1 :or non-exponential bounds using other 
techniques. 

Again, we will show that there are examples, where bounds of the form 
(|5.2p are needed. Assume to this end that Aj = f or i ^ 3 and that 
Xj = pui > for i = 1,2, where /ii -|- ^2 = 1- Then IBarbour and UtevI 
( 19981 ) show in their Example 2.2 that for the test function h{w) = (— 1)"', 
the solution g to the Stein equation satisfies 

g{l) X p-^l^ exp{p^2(l - m/(2^2))'}, 

thus, if /xi < 2/^2 (which we shall assume from now on), g{l 
tially fast as p ^ 00. 



(5.3) 

grows exponen- 



Example 5.1. Define the operator 

Big{w) = Ag{w) + ^6i{w)g{w) 

= Xig{w + 1) + 2X2g{w + 2)-{w- \5i{w))g{w). 

It is not obvious that there exists a random variable W such that MBg{W) = 
for all g, because, under our assumptions, B is not a generator of a Markov 



(5.4) 
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process. However, using the generating function approach we show below 
that if (ly) exists. 

To this end, define the sequence qo,qi, . . . by the recurrence relation 

qo = 1, qi = 2Xiqo, kqk = Xiqk-i + 2A2gfc-2 (5.5) 

for all k ^ 2. It is obvious that qk ^ for all k. Define now the generating 
function ip{z) = Ylk>o^kz'' ■ Using (|5.5p . is easy to show that if satisfies the 
differential equation 

if'iz) = Xi + {Xi + 2X2z)ip{z). (5.6) 
Straightforward calculations show that 



is a solution to (|5.6p with the initial condition (^(0) = go = 1, where k = 
^Xl/^/X2 and ip{x) = f^e~*'^dt. From 7.1.13 of lAbramowitz and Stegun 



( 19641 ) we obtain that ipix) ^ e ^^/x as a; ^ oo, thus 

J]g, = V^(l)xe^^+^^ 

k>0 



Thus, the normalised sequence pfc := qk/f{l) forms a probability distribution 
with pq X e"'' and pi x pe~P. It is straightforward to show that a random 
variable W with this distribution satisfies lEBig{W) = for all bounded g. 
Together with (|5.3p we thus finally have for such W that 



dTY{^{W),CF{pfii,pfj,2) =pi sup 1571(1)1 

^ -^exp{-p[l-/X2(l-w/(2/U2))']} 

for some constant c not depending on p. 

As we cannot access the Markov process interpretation here, it is hard to 
give an explanation on what the effect of the perturbation in (|5.4|) is. The 
specific test function in (15.31) suggest that there is some oscillatory effect 
involved. However, it is not clear whether this test function has the same as- 
ymptotic behaviour as the supremum over all solutions of the Stein equation, 
so there may well be solutions with even stronger asymptotic growth. 
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